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NOVEL HUMAN ENZYMES AND 
POLYNUCLEOTIDES ENCODING THE SAME 

The present application claims the benefit of U.S. 
5 Provisional Application Number 60/179,000 which was filed on 
January 28, 2000 and is herein incorporated by reference in its 
entirety. 



1 . INTRODUCTION 
10 The present invention relates to the discovery, 

identification, and characterization of novel human 

polynucleotides encoding proteins sharing sequence similarity with 
mammalian enzymes. The invention encompasses the described 
polynucleotides, host cell expression systems, the encoded 

15 protein, fusion proteins, polypeptides and peptides, antibodies to 
the encoded proteins and peptides, and sequencetically engineered 
animals that either lack or over express the disclosed sequences, 
antagonists and agonists of the proteins, and other compounds that 
modulate the expression or activity of the proteins encoded by the 

20 disclosed polynucleotides that can be used for diagnosis, drug 
screening, clinical trial monitoring and the treatment of 
physiological disorders. 



2. BACKGROUND OF THE INVENTION 
25 Enzymes are biological catalysts that modify biological 

substrates, including proteins, as part of degradation, 
maturation, catabolic, metabolic, differentiation, and secretory 
pathways within the body. Enzyme abnormalities have thus been 
associated with, inter alia, growth, development, protein and 
30 cellular senescence, cancer, or other diseases. 



3. SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
identification, and characterization of nucleotides that encode 

1 
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novel human proteins, and the corresponding amino acid sequences 
of these proteins. The novel human proteins (NHPs) described for 
the first time herein share structural similarity with nitrilase 
proteins from a wide variety of living organisms. 
5 The novel human nucleic acid (cDNA) sequences described 

herein, encode proteins/open reading frames (ORFs) of 276, 159, 
121, 168, 130, 152, and 285 amino acids in length (see 
respectively SEQ ID NOS: 2, 4, 6, 8, 10, 12 and 14). 

The invention also encompasses agonists and antagonists of 

10 the described NHPs including small molecules, large molecules, 
mutant NHPs, or portions thereof that compete with native NHPs, 
NHP peptides, and antibodies, as well as nucleotide sequences that 
can be used to inhibit the expression of the described NHPs (e.g., 
antisense and ribozyme molecules, and gene or regulatory sequence 

15 replacement constructs) or to enhance the expression of the 
described NHPs (e.g., expression constructs that place the 
described sequence under the control of a strong promoter system) , 
and transgenic animals that express a NHP transgene, or "knock- 
outs" (which can be conditional) that do not express a functional 

20 NHP. 

Further, the present invention also relates to processes for 
identifying compounds that modulate, i.e., act as agonists or 
antagonists, of NHP expression and/or NHP activity that utilize 
purified preparations of the described NHP and/or NHP product, or 
25 cells expressing the same. Such compounds can be used as 

therapeutic agents for the treatment of any of a wide variety of 
symptoms associated with biological disorders or imbalances. 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
30 The Sequence Listing provides the sequences of the NHP ORFs 

encoding the described NHP amino acid sequences. SEQ ID NOS: 15 
describe representative a nitrilase-like ORF with flanking 
sequences . 



2 
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5. DETAILED DESCRIPTION OF THE INVENTION 
The NHPs, described for the first time herein, are novel 
proteins that are expressed in, inter alia, human cell lines, gene 
trapped cells and human brain, fetal brain, pituitary, cerebellum, 
5 spinal cord, thymus, spleen, lymph node, bone marrow, trachea, 
lung, kidney, fetal liver, liver, prostate, testis, thyroid, 
adrenal gland, pancreas, salivary gland, stomach, small intestine, 
colon, skeletal muscle, heart, placenta, mammary gland, adipose, 
skin, esophagus, bladder, pericardium, hypothalamus, ovary, fetal 

10 kidney, and fetal lung cells. 

The described sequences were compiled from gene trapped cDNAs 
and clones isolated from human prostate, lymph node, pituitary, 
mammary gland, and kidney cDNA library (Edge Biosystems, 
Gaithersburg, MD) . The present invention encompasses the 

15 nucleotides presented in the Sequence Listing, host cells 

expressing such nucleotides, the expression products of such 
nucleotides, and: (a) nucleotides that encode mammalian homologs 
of the described sequences, including the specifically described 
NHPs, and the NHP products; (b) nucleotides that encode one or 

20 more portions of a NHP that correspond to functional domains of 

the NHP, and the polypeptide products specified by such nucleotide 
sequences, including but not limited to the novel regions of any 
active domain (s); (c) isolated nucleotides that encode mutant 
versions, engineered or naturally occurring, of a described NHP in 

25 which all or a part of at least one domain is deleted or altered, 
and the polypeptide products specified by such nucleotide 
sequences, including but not limited to soluble proteins and 
peptides in which all or a portion of the signal sequence is 
deleted; (d) nucleotides that encode chimeric fusion proteins 

30 containing all or a portion of a coding region of a NHP, or one of 
its domains (e.g., a receptor or ligand binding domain, accessory 
protein/self -association domain, etc.) fused to another peptide or 
polypeptide; or (e) therapeutic or diagnostic derivatives of the 
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described polynucleotides such as oligonucleotides, antisense 
polynucleotides, ribozymes, dsRNA, or gene therapy constructs 
comprising a sequence first disclosed in the Sequence Listing. 

As discussed above, the present invention includes: (a) the 
5 human DNA sequences presented in the Sequence Listing (and vectors 
comprising the same) and additionally contemplates any nucleotide 
sequence encoding a contiguous NHP open reading frame (ORF) , or a 
contiguous exon splice junction first described in the Sequence 
Listing, that hybridizes to a complement of a DNA sequence 

10 presented in the Sequence Listing under highly stringent 

conditions, e.g., hybridization to filter-bound DNA in 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and 
washing in 0.1xSSC/0.1% SDS at 68°C (Ausubel F.M. et al., eds . , 
1989, Current Protocols in Molecular Biology, Vol. I, Green 

15 Publishing Associates, Inc., and John Wiley & sons, Inc., New 
York, at p. 2.10.3) and encodes a functionally equivalent gene 
product. Additionally contemplated are any nucleotide sequences 
that hybridize to the complement of the DNA sequence that encode 
and express an amino acid sequence presented in the Sequence 

20 Listing under moderately stringent conditions, e.g., washing in 
0.2xSSC/0.1% SDS at 42°C (Ausubel et al . , 1989, supra) , yet still 
encode a functionally equivalent NHP product. Functional 
equivalents of a NHP include naturally occurring NHPs present in 
other species and mutant NHPs whether naturally occurring or 

25 engineered (by site directed mutagenesis, gene shuffling, directed 
evolution as described in, for example, U.S. Patent No. 
5,837,458), The invention also includes degenerate nucleic acid 
variants of the disclosed NHP polynucleotide sequences. 

Additionally contemplated are polynucleotides encoding a NHP 

30 ORF, or its functional equivalent, encoded by a polynucleotide 

sequence that is about 99, 95, 90, or about 85 percent similar or 
identical to corresponding regions of the nucleotide sequences of 
the Sequence Listing (as measured by BLAST sequence comparison 

4 
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analysis using, for example, the GCG sequence analysis package 
using standard default settings) . 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, that hybridize to, and are therefore the 
5 complements of, the described NHP nucleotide sequences. Such 
hybridization conditions may be highly stringent or less highly 
stringent, as described above. In instances where the nucleic 
acid molecules are deoxyoligonucleotides ("DNA oligos"), such 
molecules are generally about 16 to about 100 bases long, or about 

10 20 to about 80, or about 34 to about 45 bases long, or any 
variation or combination of sizes represented therein that 
incorporate a contiguous region of sequence first disclosed in the 
Sequence Listing. Such oligonucleotides can be used in 
conjunction with the polymerase chain reaction (PCR) to screen 

15 libraries, isolate clones, and prepare cloning and sequencing 
templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing gene 
expression patterns (particularly using a micro array or high- 

20 throughput "chip" format) . Additionally, a series of the 
described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 
sequence first disclosed in at least a portion of one or more of 

25 the sequences of SEQ ID NOS: 1-15 can be used as a hybridization 
probe in conjunction with a solid support matrix/substrate 
(resins, beads, membranes, plastics, polymers, metal or metallized 
substrates, crystalline or polycrystalline substrates, etc.). Of 
particular note are spatially addressable arrays (i.e., gene 

30 chips, microtiter plates, etc.) of oligonucleotides and 

polynucleotides, or corresponding oligopeptides and polypeptides, 
wherein at least one of the biopolymers present on the spatially 
addressable array comprises an oligonucleotide or polynucleotide 

5 



WO 01/94566 



PC17US01/26687 



sequence first disclosed in at least one of the sequences of SEQ 
ID NOS: 1-15, or an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support matrices, and conducting binding studies thereon are 
5 disclosed in, inter alia, U.S. Patent Nos. 5,700,637, 5,556,752, 
5,744,305, 4,631,211, 5,445,934, 5,252,743, 4,713,326, 5,424,186, 
and 4,689,405 the disclosures of which are herein incorporated by 
reference in their entirety. 

Addressable arrays comprising sequences first disclosed in 

10 SEQ ID NOS: 1-15 can be used to identify and characterize the 
temporal and tissue specific expression of a gene. These 
addressable arrays incorporate oligonucleotide sequences of 
sufficient length to confer the required specificity, yet be 
within the limitations of the production technology. The length 

15 of these probes is within a range of between about 8 to about 2000 
nucleotides. Preferably the probes consist of 60 nucleotides and 
more preferably 25 nucleotides from the sequences first disclosed 
in SEQ ID NOS: 1-15. 

For example, a series of the described oligonucleotide 

20 sequences, or the complements thereof, can be used in chip format 
to represent all or a portion of the described sequences. The 
oligonucleotides, typically between about 16 to about 40 (or any 
whole number within the stated range) nucleotides in length can 
partially overlap each other and/or the sequence may be 

25 represented using oligonucleotides that do not overlap. 
Accordingly, the described polynucleotide sequences shall 
typically comprise at least about two or three distinct 
oligonucleotide sequences of at least about 8 nucleotides in 
length that are each first disclosed in the described Sequence 

30 Listing. Such oligonucleotide sequences can begin at any 

nucleotide present within a sequence in the Sequence Listing and 
proceed in either a sense (B'-to-S') orientation vis-a-vis the 
described sequence or in an antisense orientation. 

6 
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Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of gene 
functions and generating novel and unexpected insight into 
transcriptional processes and biological mechanisms. The use of 
5 addressable arrays comprising sequences first disclosed in SEQ ID 
NOS:l-15 provides detailed information about transcriptional 
changes involved in a specific pathway, potentially leading to the 
identification of novel components or gene functions that manifest 
themselves as novel phenotypes. 

10 Probes consisting of sequences first disclosed in SEQ ID 

NOS:l-15 can also be used in the identification, selection and 
validation of novel molecular targets for drug discovery. The use 
of these unique sequences permits the direct confirmation of drug 
targets and recognition of drug dependent changes in gene 

15 expression that are modulated through pathways distinct from the 
drugs intended target. These unique sequences therefore also have 
utility in defining and monitoring both drug action and toxicity. 

As an example of utility, the sequences first disclosed in 
SEQ ID NOS:l-15 can be utilized in microarrays or other assay 

20 formats, to screen collections of genetic material from patients 
who have a particular medical condition. These investigations can 
also be carried out using the sequences first disclosed in SEQ ID 
NOS:l-15 in silico and by comparing previously collected genetic 
databases and the* disclosed sequences using computer software 

25 known to those in the art. 

Thus the sequences first disclosed in SEQ ID NOS:l-15 can be 
used to identify mutations associated with a particular disease 
and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 

30 specifically described using nucleotide sequence, it should be 
appreciated that each of the sequences can uniquely be described 
using any of a wide variety of additional structural attributes, 
or combinations thereof. For example, a given sequence can be 

7 
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described by the net composition of the nucleotides present within 
a given region of the sequence in conjunction with the presence of 
one or more specific oligonucleotide' sequence (s) first disclosed 
in the SEQ ID NOS: 1-15. Alternatively, a restriction map 
5 specifying the relative positions of restriction endonuclease 
digestion sites, or various palindromic or other specific 
oligonucleotide sequences can be used to structurally describe a 
given sequence- Such restriction maps, which are typically 
generated by widely available computer programs (e.g., the 

10 University of Wisconsin GCG sequence analysis package, SEQUENCHER 
3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can optionally be 
used in conjunction with one or more discrete nucleotide 
sequence (s) present in the sequence that can be described by the 
relative position of the sequence relatve to one or more 

15 additional sequence (s) or one or more restriction sites present in 
the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions may 
refer, for example, to washing in 6xSSC/0.05% sodium pyrophosphate 
at 37°C (for 14-base oligos) , 48°C (for 17-base oligos) , 55°C (for 

20 20-base oligos), and 60°C (for 23-base oligos). These nucleic 

acid molecules may encode or act as NHP gene antisense molecules, 
useful, for example, in NHP gene regulation (for and/or as 
antisense primers in amplification reactions of NHP nucleic acid 
sequences) . With respect to NHP gene regulation, such techniques 

25 can be used to regulate biological functions. Further, such 
sequences may be used as part of ribozyme and/or triple helix 
sequences that are also useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides can 
additionally comprise at least one modified base moiety which is 

30 selected from the group including but not limited to 

5-f luorouracil , 5-bromouracil , 5-chlorouracil , 5-iodouracil , 
hypoxanthine, xantine, 4-acetylcytosine, 5- (carboxyhydroxylmethyl) 
uracil , 5-carboxymethylaminomethyl-2-thiouridine , 

8 
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5-carboxymethylaminomethyluracil, dihydrouracil , beta-D- 
galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2, 2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3~methylcytosine, 
5 5-methylcytosine, N6-adenine, 7-methylguanine, 

5-methylaminomethyluracil, 5-methoxyaminomethyl-2--thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5- 
oxyacetic acid (v) , wybutoxosine, pseudouracil, queosine, 

10 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 
5-methyluracil, uracil-5-oxyacetic acid methylester, uracil- 
5-oxyacetic acid (v) , 5-methyl-2-thiouracil, 3- (3-amino-3~N~2- 
carboxypropyl) uracil, (acp3)w, and 2, 6-diaminopurine . 

The antisense oligonucleotide can also comprise at least one 

15 modified sugar moiety selected from the group including but not 
limited to arabinose, 2-f luoroarabinose, xylulose, and hexose. 

In yet another embodiment, the antisense oligonucleotide will 
comprise at least one modified phosphate backbone selected from 
the group consisting of a phosphorothioate, a phosphorodithioate, 

2 0 a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 
methylphosphonate, an alkyl phosphotriester , and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is 
an a-anomeric oligonucleotide. An a-anomeric oligonucleotide 

25 forms specific double-stranded hybrids with complementary RNA in 
which, contrary to the usual p-units, the strands run parallel to 
each other (Gautier et al., 1987, Nucl. Acids Res. 15:6625-6641). 
The oligonucleotide is a 2'-0-methylribonucleotide (Inoue et al., 
1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 

30 analogue (Inoue et al. f 1987, FEBS Lett. 215:327-330). 

Alternatively, double stranded RNA can be used to disrupt the 
expression and function of a targeted NHP. 

9 



WO 01/94566 



PCT/US01/26687 



Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g. by use of an automated DNA 
synthesizer (such as are commercially available from Biosearch, 
Applied Biosystems, etc.). As examples, phosphorothioate 
5 oligonucleotides can be synthesized by the method of Stein et al. 
(1988, Nucl. Acids Res. 16:3209), and methylphosphonate 
oligonucleotides can be prepared by use of controlled pore glass 
polymer supports (Sarin et al., 1988, Proc. Natl. Acad. Sci. 
U.S.A. 85:7448-7451), etc. 

10 Low stringency conditions are well known to those of skill in 

the art, and will vary predictably depending on the specific 
organisms from which the library and the labeled sequences are 
derived. For guidance regarding such conditions see, for example, 
Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual (and 

15 periodic updates thereof), Cold Springs Harbor Press, N.Y.; and 
Ausubel et al., 1989, Current Protocols in Molecular Biology, 
Green Publishing Associates and Wiley Interscience, N.Y. 

Alternatively, suitably labeled NHP nucleotide probes can be 
used to screen a human genomic library using appropriately 

20 stringent conditions or by PCR. The identification and 
characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
polymorphisms, or coding single nucleotide polymorphisms) , 

25 determining the genomic structure of a given locus/allele, and 
designing diagnostic tests. For example, sequences derived from 
regions adjacent to the intron/exon boundaries of the human gene 
can be used to design primers for use in amplification assays to 
detect mutations within the exons, introns, splice sites (e.g., 

30 splice acceptor and/or donor sites), etc., that can be used in 
diagnostics and pharmacogenomics . 

Further, a NHP homolog can be isolated from nucleic acid from 
an organism of interest by performing PCR using two degenerate or 

10 
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^wobble" oligonucleotide primer pools designed on the basis of 
amino acid sequences within the NHP products disclosed herein. 
The template for the reaction may be total RNA, mRNA r and/or cDNA 
obtained by reverse transcription of mRNA prepared from human or 
5 non-human cell lines or tissue known or suspected to express an 
allele of a NHP gene. 

The PCR product can be subcloned and sequenced to ensure that 
the amplified sequences represent the sequence of the desired NHP 
sequence. The PCR fragment can then be used to isolate a full 

10 length cDNA clone by a variety of methods. For example, the 
amplified fragment can be labeled and used to screen a cDNA 
library, such as a bacteriophage cDNA library. Alternatively, the 
labeled fragment can be used to isolate genomic clones via the 
screening of a genomic library. 

15 PCR technology can also be used to isolate full length cDNA 

sequences. For example, RNA can be isolated, following standard 
procedures, from an appropriate cellular or tissue source (i.e., 
one known, or suspected, to express a NHP sequence, such as, for 
example, testis tissue) . A reverse transcription (RT) reaction 

20 can be performed on the RNA using an oligonucleotide primer 

specific for the most 5' end of the amplified fragment for the 
priming of first strand synthesis. The resulting RNA/DNA hybrid 
may then be "tailed" using a standard terminal transferase 
reaction, the hybrid may be digested with RNase H, and second 

25 strand synthesis may then be primed with a complementary primer. 
Thus, cDNA sequences upstream of the amplified fragment can be 
isolated. For a review of cloning strategies that can be used, 
see e.g., Sambrook et al., 1989, supra. 

A cDNA encoding a mutant NHP sequence can be isolated, for 

30 example, by using PCR. In this case, the first cDNA strand may be 
synthesized by hybridizing an oligo-dT oligonucleotide to mRNA 
isolated from tissue known or suspected to be expressed in an 
individual putatively carrying a mutant NHP allele, and by 



11 



WO 01/94566 



PCT/USO 1/26687 



extending the new strand with reverse transcriptase. The second 
strand of the cDNA is then synthesized using an oligonucleotide 
that hybridizes specifically to the 5 1 end of the normal gene. 
Using these two primers , the product is then amplified via PCR, 
5 optionally cloned into a suitable vector, and subjected to DNA 
sequence analysis through methods well known to those of skill in 
the art. By comparing the DNA sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation (s) 
responsible for the loss or alteration of function of the mutant 

10 NHP gene product can be ascertained. 

Alternatively, a genomic library can be constructed using DNA 
obtained from an individual suspected of or known to carry a 
mutant NHP allele (e.g., a person manifesting a NHP-associated 
phenotype such as, for example, obesity, high blood pressure, 

15 connective tissue disorders, infertility, etc.), or a cDNA library 
can be constructed using RNA from a tissue known, or suspected, to 
express a mutant NHP allele. A normal NHP sequence, or any 
suitable fragment thereof, can then be labeled and used as a probe 
to identify the corresponding mutant NHP allele in such libraries. 

20 Clones containing mutant NHP sequences can then be purified and 
subjected to sequence analysis according to methods well known to 
those skilled in the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated from a 

25 tissue known, or suspected, to express a mutant NHP allele in an 
individual suspected of or known to carry such a mutant allele. 
In this manner, gene products made by the putatively mutant tissue 
can be expressed and screened using standard antibody screening 
techniques in conjunction with antibodies raised against normal 

30 NHP product, as described below. (For screening techniques, see, 
for example, Harlow, E. and Lane, eds . , 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, Cold Spring Harbor.) 



12 
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Additionally, screening can be accomplished by screening with 
labeled NHP fusion proteins, such as, for example, alkaline 
phosphatase-NHP or NHP-alkaline phosphatase fusion proteins. 
In cases where a NHP mutation results in an expressed gene product 
5 with altered function (e.g., as a result of a missense or a 

frameshift mutation) , polyclonal antibodies to NHP are likely to 
cross-react with a corresponding mutant NHP gene product. Library 
clones detected via their reaction with such labeled antibodies 
can be purified and subjected to sequence analysis according to 

10 methods well known in the art. 

The invention also encompasses (a) DNA vectors that contain 
any of the foregoing NHP coding sequences and/or their complements 
(i.e., antisense) ; (b) DNA expression vectors that contain any of 
the foregoing NHP coding sequences operatively associated with a 

15 regulatory element that directs the expression of the coding 

sequences (for example, baculo virus as described in U.S. Patent 
No. 5,869,336 herein incorporated by reference); (c) genetically 
engineered host cells that contain any of the foregoing NHP coding 
sequences operatively associated with a regulatory element that 

20 directs the expression of the coding sequences in the host cell; 
and (d) genetically engineered host cells that express an 
endogenous NHP sequence under the control of an exogenously 
introduced regulatory element (i.e., gene activation). As used 
herein, regulatory elements include, but are not limited to, 

25 inducible and non-inducible promoters, enhancers, operators and 
other elements known to those skilled in the art that drive and 
regulate expression. Such regulatory elements include but are not 
limited to the cytomegalovirus (hCMV) immediate early gene, 
regulatable, viral elements (particularly retroviral LTR 

30 promoters), the early or late promoters of SV40 adenovirus, the 
lac system, the trp system, the TAC system, the TRC system, the 
major operator and promoter regions of phage lambda, the control 
regions of fd coat protein, the promoter for 3-phosphoglycerate 
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kinase (PGK) , the promoters of acid phosphatase, and the promoters 
of the yeast a-mating factors. 

The present invention also encompasses antibodies and anti- 
idiotypic antibodies (including Fab fragments) , antagonists and 
5 agonists of a NHP, as well as compounds or nucleotide constructs 
that inhibit expression of a NHP gene (transcription factor 
inhibitors, antisense and ribozyme molecules, or gene or 
regulatory sequence replacement constructs) r or promote the 
expression of a NHP (e.g., expression constructs in which NHP 

10 coding sequences are operatively associated with expression 

control elements such as promoters, promoter /enhancers, etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP nucleotide 
sequences, antibodies, antagonists and agonists can be useful for 
the detection of mutant NHPs or inappropriately expressed NHPs for 

15 the diagnosis of disease. The NHP proteins or peptides, NHP 

fusion proteins, NHP nucleotide sequences, host cell expression 
systems, antibodies, antagonists, agonists and genetically 
engineered cells and animals can be used for screening for drugs 
(or high throughput screening of combinatorial libraries) 

20 effective in the treatment of the symptomatic or phenotypic 

manifestations of perturbing the normal function of a NHP in the 
body. The use of engineered host cells and/or animals may offer 
an advantage in that such systems allow not only for the 
identification of compounds that bind to the endogenous receptor 

25 for a NHP, but can also identify compounds that trigger NHP- 
mediated activities or pathways. 

Finally, the NHP products can be used as therapeutics. For 
example, soluble derivatives such as NHP peptides /domains 
corresponding to NHP, NHP fusion protein products (especially NHP- 

30 Ig fusion proteins, i.e., fusions of a NHP, or a domain of a NHP, 
to an IgFc) , NHP antibodies and anti-idiotypic antibodies 
(including Fab fragments) , antagonists or agonists (including 
compounds that modulate or act on downstream targets in a NHP- 
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mediated pathway) can be used to directly treat diseases or 
disorders. For instance, the administration of an effective 
amount of soluble NHP, or a NHP-IgFc fusion protein or an anti- 
idiotypic antibody (or its Fab) that mimics the NHP could activate 
5 or effectively antagonize the endogenous NHP receptor. Nucleotide 
constructs encoding such NHP products can be used to genetically 
engineer host cells to express such products in vivo; these 
genetically engineered cells function as "bioreactors" in the body 
delivering a continuous supply of a NHP, a NHP peptide, or a NHP 

10 fusion protein to the body. Nucleotide constructs encoding 

functional NHP, mutant NHPs , as well as antisense and ribozyme 
molecules can also be used in "gene therapy" approaches for the 
modulation of NHP expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for treating 

15 biological disorders. 

Various aspects of the invention are described in greater 
detail in the subsections below. 

5.1 THE NHP SEQUENCES 

20 The cDNA sequences and the corresponding deduced amino acid 

sequences of the described NHP are presented in the Sequence 
Listing. SEQ ID NOS:15 describes the NHP ORFs as well as flanking 
regions. The NHP nucleotides were obtained from human cDNA 
libraries using probes and/or primers generated from human gene 

25 trapped sequence tags. Expression analysis has provided evidence 
that the described NHP can be expressed in a variety of human 
cells as well as gene trapped human cells. 

5.2 NHP AND NHP POLYPEPTIDES 
30 NHPs, polypeptides, peptide fragments, mutated, truncated, or 

deleted forms of the NHPs, and/or NHP fusion proteins can be 
prepared for a variety of uses. These uses include, but are not 
limited to, the generation of antibodies, as reagents in 



15 



WO 01/94566 



PCT/US01/26687 



diagnostic assays, for the identification of other cellular gene 
products related to a NHP f as reagents in assays for screening for 
compounds that can be as pharmaceutical reagents useful in the 
therapeutic treatment of mental, biological, or medical disorders 
5 and disease. 

The Sequence Listing discloses the amino acid sequence 
encoded by the described NHP polynucleotides. The NHPs display 
initiator methionines in DNA sequence contexts consistent with 
translation initiation sites, and apparently display signal 

10 sequences which can indicate that the described NHP ORFs are 
secreted proteins or possibly membrane associated. 

The NHP amino acid sequences of the invention include the 
amino acid sequences presented in the Sequence Listing as well as 
analogues and derivatives thereof, as well as any oligopeptide 

15 sequence of at least about 10-40, generally about 12-35, or about 
16-30 amino acids in length first disclosed in the Sequence 
Listing- Further, corresponding NHP homologues from other species 
are encompassed by the invention. In fact, any NHP encoded by the 
NHP nucleotide sequences described above are within the scope of 

20 the invention, as are any novel polynucleotide sequences encoding 
all or any novel portion of an amino acid sequence presented in 
the Sequence Listing. The degenerate nature of the genetic code 
is well known, and, accordingly, each amino acid presented in the 
Sequence Listing, is generically representative of the well known 

25 nucleic acid ^triplet" codon, or in many cases codons, that can 

encode the amino acid. As such, as contemplated herein, the amino 
acid sequences presented in the Sequence Listing, when taken 
together with the genetic code (see, for example, Table 4-1 at 
page 109 of "Molecular Cell Biology", 1986, J. Darnell et al. 

30 eds., Scientific American Books, New York, NY, herein incorporated 
by reference) are generically representative of all the various 
permutations and combinations of nucleic acid sequences that can 
encode such amino acid sequences. 
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The invention also encompasses proteins that are functionally 
equivalent to the NHPs encoded by the presently described 
nucleotide sequences as judged by any of a number of criteria, 
including, but not limited to, the ability to bind and cleave a 
5 substrate of a NHP, or the ability to effect an identical or 
complementary downstream pathway, or a. change in cellular 
metabolism (e.g., proteolytic activity, ion flux, tyrosine 
phosphorylation, etc. ) . Such functionally equivalent NHP proteins 
include, but are not limited to, additions or substitutions of 

10 amino acid residues within the amino acid sequence encoded by the 
NHP nucleotide sequences described above, but which result in a 
silent change, thus producing a functionally equivalent gene 
product. Amino acid substitutions can be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity*, 

15 hydrophilicity, and/or the amphipathic nature of the residues 

involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

20 glutamine; positively charged (basic) amino acids include 

arginine, lysine, and histidine; and negatively charged (acidic) 
amino acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used to 
express the NHP nucleotide sequences of the invention. Where, as 

25 in the present instance, the NHP products or NHP polypeptides can 
be produced in soluble or secreted forms (by removing one or more 
transmembrane domains where applicable) , the peptide or 
polypeptide can be recovered from the culture media. Such 
expression systems also encompass engineered host cells that 

30 express a NHP, or a functional equivalent, in situ. Purification 
or enrichment of NHP from such expression systems can be 
accomplished using appropriate detergents and lipid micelles and 
methods well known to those skilled in the art. However, such 
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engineered host cells themselves may be used in situations where 
it is important not only to retain the structural and functional 
characteristics of the NHP, but to assess biological activity, 
e.g., in drug screening assays. 



invention include but are not limited to microorganisms such as 
bacteria (e.g., E. coli, B. subtilis) transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing NHP nucleotide sequences; yeast (e.g., Saccharomyces, 

10 Pichla) transformed with recombinant yeast expression vectors 

containing NHP encoding nucleotide sequences; insect cell systems 
infected with recombinant virus expression vectors (e.g., 
baculovirus) containing NHP sequences; plant cell systems infected 
with recombinant virus expression vectors (e.g., cauliflower 

15 mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) 
containing NHP nucleotide sequences; or mammalian cell systems 
(e.g., COS, CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing promoters derived from the genome of 

.20 mammalian cells (e.g., metallothionein promoter) or from mammalian 
viruses (e.g., the adenovirus late promoter; the vaccinia virus 
7 . 5K promoter) . 

In bacterial systems, a number of expression vectors may be 
advantageously selected depending upon the use intended for the 

25 NHP product being expressed. For example, when a large quantity 
of such a protein is to be produced for the generation of 
pharmaceutical compositions of or containing NHP, or for raising 
antibodies to a NHP, vectors that direct the expression of high 
levels of fusion protein products that are readily purified may be 

30 desirable. Such vectors include, but are not limited, to the E. 
coli expression vector pUR278 (Ruther et al., 1983, EMBO J. 
2:1791), in which a NHP coding sequence may be ligated 



5 



The expression systems that may be used for purposes of the 
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individually into the vector in frame with the lacZ coding region 
so that a fusion protein is produced; pIN vectors (Inouye & 
Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke & 
Schuster, 1989, J. Biol. Chem. 254:5503-5509); and the like. pGEX 
5 vectors (Pharmacia or American Type Culture Collection) can also 
be used to express foreign polypeptides as fusion proteins with 
glutathione S-transf erase (GST) . In general, such fusion proteins 
are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in the 

10 presence of free glutathione. The PGEX vectors are designed to 
include thrombin or factor Xa protease cleavage sites so that the 
cloned target gene product can be released from the GST moiety. 

In an insect system, Autographs calif ornica nuclear 
polyhidrosis virus (AcNPV) is used as a vector to express foreign 

15 sequences. The virus grows in Spodoptera frugiperda cells. A NHP 
coding sequence can be cloned individually into non-essential 
regions (for example the polyhedrin gene) of the virus and placed 
under control of an AcNPV promoter (for example the polyhedrin 
promoter) . Successful insertion of NHP coding sequence will 

20 result in inactivation of the polyhedrin gene and production of 
non-occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 
recombinant viruses are then used to infect Spodoptera frugiperda 
cells in which the inserted sequence is expressed (e.g., see 

25 Smith et al., 1983, J. Virol. 46: 584; Smith, U.S. Patent No. 
4,215,051) . 

In mammalian host cells, a number of viral-based expression 
systems may be utilized. In cases where an adenovirus is used as 
an expression vector, the NHP nucleotide sequence of interest may 
30 be ligated to an adenovirus transcription/translation control 

complex, e.g., the late promoter and tripartite leader sequence. 
This chimeric sequence may then be inserted in the adenovirus 
genome by in vitro or in vivo recombination. Insertion in a non- 
19 
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essential region of the viral genome (e.g., region El or E3) will 
result in a recombinant virus that is viable and -capable of 
expressing a NHP product in infected hosts (e.g., See Logan & 
Shenk, 1984, Proc. Natl. Acad. Sci. USA 51:3655-3659). Specific 
5 initiation signals may also be required for efficient translation 
of inserted NHP nucleotide sequences. These signals include the 
ATG initiation codon and adjacent sequences. In cases where an 
entire NHP sequence or cDNA, including its own initiation codon 
-and adjacent sequences, is inserted into the appropriate 

10 expression vector, no additional translational control signals may 
be needed. However, in cases where only a portion of a NHP coding 
sequence is inserted, exogenous translational control signals, 
including, perhaps, the ATG initiation codon, must be provided. 
Furthermore, the initiation codon must be in phase with the 

15 reading frame of the desired coding sequence to ensure translation 
of the entire insert. These exogenous translational control 
signals and initiation codons can be of a variety of origins, both 
natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of appropriate transcription enhancer 

20 elements, transcription terminators, etc. (See Bittner et al., 
1987, Methods in Enzymol. 153:516-544). 

In addition, a host cell strain may be chosen that modulates 
the expression of the inserted sequences, or modifies and 
processes the gene product in the specific fashion desired. Such 

25 modifications (e.g., glycosylation) and processing (e.g., 

cleavage) of protein products may be important for the function of 
the protein. Different host cells have characteristic and 
specific mechanisms for the post-translational processing and 
modification of proteins and gene products. Appropriate cell 

30 lines or host systems can be chosen to ensure the correct 

modification and processing of the foreign protein expressed- To 
this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, 

20 
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glycosylation, and phosphorylation of the gene product may be 
used. Such mammalian host cells include, but are not limited to, 
CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and in 
particular, human cell lines. 
5 For long-term, high-yield production of recombinant proteins, 

stable expression is preferred. For example, cell lines which 
stably express the NHP sequences described above can be 
engineered. Rather than using expression vectors which contain 
viral origins of replication, host cells can be transformed with 

10 DNA controlled by appropriate expression control elements (e.g., 
promoter, enhancer sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. Following 
the introduction of the foreign DNA, engineered cells may be 
allowed to grow for 1-2 days in an enriched media, and then are 

15 switched to a selective media. The selectable marker in the 

recombinant plasmid confers resistance to the selection and allows 
cells to stably integrate the plasmid into their chromosomes and 
grow to form foci which in turn can be cloned and expanded into 
cell lines. This method may advantageously be used to engineer 

20 cell lines which express the NHP product. Such engineered cell 
lines may be particularly useful in screening and evaluation of 
compounds that affect the endogenous activity of the NHP product. 

A number of selection systems may be used, including but not 
limited to the herpes simplex virus thymidine kinase (Wigler, et 

25 al., 1977, Cell 11:223), hypoxanthine-guanine 

phosphoribosyltransf erase (Szybalska & Szybalski, 1962, Proc. 
Natl., Acad. Sci. USA 48:2026), and adenine 

phosphoribosyltransf erase (Lowy, et al., 1980, Cell 22:817) genes 
can be employed in tk", hgprt" or aprt" cells, respectively. Also, 
30 antimetabolite resistance can be used as the basis of selection 
for the following genes: dhfr, which confers resistance to 
methotrexate (Wigler, et al., 1980, Natl. Acad. Sci. USA 77:3567; 
O'Hare, et ai., 1981, Proc. Natl. Acad. Sci. USA 78:1521); gpt, 
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which confers resistance to mycophenolic acid (Mulligan & Berg, 
1981, Proc. Natl. Acad. Sci. USA 78:2072); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garapin, et al. r 
1981, J. Mol. Biol, 150:1); and hygro, which confers resistance to 
5 hygromycin (Santerre, et al., 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified by 
utilizing an antibody specific for the fusion protein being 
expressed. For example, a system described by Janknecht et al. 
allows for the ready purification of non-denatured fusion proteins 

10 expressed in human cell lines (Janknecht, et al., 1991, Proc. 

Natl. Acad. Sci. USA 58:8972-8976). In this system, the gene of 
interest is subcloned into a vaccinia recombination plasmid such 
that the gene's open reading frame is translationally fused to an 
amino-terminal tag consisting of six histidine residues. Extracts 

15 from cells infected with recombinant vaccinia virus are loaded 

onto Ni 2+ -nitriloacetic acid-agarose columns and histidine-tagged 
proteins are selectively eluted with imidazole-containing buffers. 

Also encompassed by the present invention are fusion proteins 
that direct the NHP to a target organ and/or facilitate transport 

20 across the membrane into the cytosol. Conjugation of NHPs to 

antibody molecules or their Fab fragments could be used to target 
cells bearing a particular epitope. Attaching the appropriate 
signal sequence to the NHP would also transport the NHP to the 
desired location within the cell. Alternatively targeting of NHP 

25 or its nucleic acid sequence might be achieved using liposome or 
lipid complex based delivery systems. Such technologies are 
described in Liposomes: A Practical Approach , New, RRC ed., Oxford 
University Press, New York and in U.S. Patents Nos. 4,594,595, 
5,459,127, 5,948,767 and 6,110,490 and their respective 

30 disclosures which are herein incorporated by reference in their 
entirety. Additionally embodied are novel protein constructs 
engineered in such a way that they facilitate transport of the NHP 
to the target site or desired organ, where they cross the cell 
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membrane and/or the nucleus where the NHP can exert its functional 
activity. This goal may be achieved by coupling of the NHP to a 
cytokine or other ligand that provides targeting specificity, 
and/or to a protein transducing domain (see generally U.S. 
5 applications Ser. No. 60/111,701 and 60/056,713, both of which are 
herein incorporated by reference, for examples of such transducing 
sequences) to facilitate passage across cellular membranes and can 
optionally be engineered to include nuclear localization 
sequences . 

10 5.3 ANTIBODIES TO NHP PRODUCTS 

Antibodies that specifically recognize one or more epitopes 
of a NHP, or epitopes of conserved variants of a NHP, or peptide 
fragments of a NHP are also encompassed by the invention. Such 
antibodies include but are not limited to polyclonal antibodies, 

15 monoclonal antibodies (mAbs), humanized or chimeric antibodies, 
single chain antibodies, Fab fragments, F(ab f ) 2 fragments, 
fragments produced by a Fab expression library, anti-idiotypic 
(anti-Id) antibodies, and epitope-binding fragments of any of the 
above . 

20 The antibodies of the invention may be used, for example, in 

the detection of NHP in a biological sample and may, therefore, be 
utilized as part of a diagnostic or prognostic technique whereby 
patients may be tested for abnormal amounts of NHP. Such 
antibodies may also be utilized in conjunction with, for example, 

25 compound screening schemes for the evaluation of the effect of 
test compounds on expression and/or activity of a NHP gene 
product. Additionally, such antibodies can be used in conjunction 
gene therapy to, for example, evaluate the normal and/or 
engineered NHP-expressing cells prior to their introduction into 

30 the patient. Such antibodies may additionally be used as a method 
for the inhibition of abnormal NHP activity. Thus, such 
antibodies may, therefore, be utilized as part of treatment 
methods . 
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For the production of antibodies, various host animals may be 
immunized by injection with the NHP, an NHP peptide (e.g., one 
corresponding to a functional domain of an NHP) , truncated NHP 
polypeptides (NHP in which one or more domains have been deleted) , 
5 functional equivalents of the NHP or mutated variant of the NHP. 
Such host animals may include but are not limited to pigs, 
rabbits, mice, goats, and rats, to name but a few. Various 
adjuvants may be used to increase the immunological response, 
depending on the host species, including but not limited to 

10 Freund's adjuvant (complete and incomplete), mineral salts such as 
aluminum hydroxide or aluminum phosphate, surface active 
substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, and potentially useful human adjuvants 
such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. 

15 Alternatively, the immune response could be enhanced by 

combination and or coupling with molecules such as keyhole limpet 
hemocyanin, tetanus toxoid, diptheria toxoid, ovalbumin, cholera 
toxin or fragments thereof. Polyclonal antibodies are 
heterogeneous populations of antibody molecules derived from the 

20 sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous populations of 
antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody molecules 
by continuous cell lines in culture. These include, but are not 

25 limited to, the hybridoma technique of Kohler and Milstein, (1975, 
Nature 255:495-497; and U.S. Patent No. 4,376,110), the human B- 
cell hybridoma technique (Kosbor et al . , 1983, Immunology Today 
4:72; Cole et al., 1983, Proc. Natl. Acad. Sci. USA 80:2026-2030), 
and the EBV-hybridoma technique (Cole et a 1 . r 1985, Monoclonal 

30 Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Such antibodies may be of any immunoglobulin class including IgG, 
IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 
producing the mAb of this invention may be cultivated in vitro or 
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in vivo. Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al., 1984, Proc. Natl. Acad. 
5 Sci., 81:6851-6855; Neuberger et al., 1984, Nature, 312:604-608; 
Takeda et al., 1985, Nature, 314:452-454) by splicing the genes 
from a mouse antibody molecule of appropriate antigen specificity 
together with genes from a human antibody molecule of appropriate 
biological activity can be used. A chimeric antibody is a 

10 molecule in which different portions are derived from different 
animal species, such as those having a variable region derived 
from a murine rtiAb and a human immunoglobulin constant region. Such 
technologies are described in U.S. Patents Nos. 6,075,181 and 
5,877,397 and their respective disclosures which are herein 

15 incorporated by reference in their entirety. Also favored is the 
production of fully humanized monoclonal antibodies as described 
in US Patent No. 6,150,584 and respective disclosures which are 
herein incorporated by reference in their entirety. 

Alternatively, techniques described for the production of 

20 single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 

Science 242:423-426? Huston et al., 1988, Proc. Natl. Acad. Sci. 
USA 85:5879-5883; and Ward et al., 1989, Nature 334:544-546) can 
be adapted to produce single chain antibodies against NHP gene 
products. Single chain antibodies are formed by linking the heavy 

25 and light chain fragments of the Fv region via an amino acid 
bridge, resulting in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes may be 
generated by known techniques. For example, such fragments 
include, but are not limited to: the F(ab f ) 2 fragments which can 

30 be produced by pepsin digestion of the antibody molecule and the 
Fab fragments which can be generated by reducing the disulfide 
bridges of the F(ab ? ) 2 fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al. r 1989, Science, 

25 
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245:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
anti-idiotype antibodies that "mimic" a given NHP, using 
5 techniques well known to those skilled in the art. {See, e.g., 
Greenspan & Bona, 1993, FASEB J 7 (5) : 437-444 ; and Nissinoff, 1991, 
J. Immunol. 147 (8) : 2429-2438) . For example antibodies which bind 
to a NHP domain and competitively inhibit the binding of NHP to 
its cognate receptor can be used to generate anti-idiotypes that 

10 "mimic" the NHP and, therefore, bind and activate or neutralize a 
receptor. Such anti-idiotypic antibodies or Fab fragments of such 
anti-idiotypes can be used in therapeutic regimens involving a NHP 
signaling pathway. 

The present invention' is not to be limited in scope by the 

15 specific embodiments described herein, which are intended as 

single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the 
scope of the invention. Indeed, various modifications of the 
invention, in addition to those shown and described herein will 

20 become apparent to those skilled in the art from the foregoing 
description. Such modifications are intended to fall within the 
scope of the appended claims. All cited publications, patents, 
and patent applications are herein incorporated by reference in 
their entirety. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid molecule comprising at 
least 24 contiguous bases of nucleotide sequence first disclosed 

5 in SEQ ID NO: 1. 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 
10 NO: 2; and 



amino acid sequence described in SEQ ID NO: 2. 

4. An isolated oligopeptide comprising at least about 
12 amino acids in a sequence first disclosed in SEQ ID NO: 2. 

20 



(b) 



hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO: 1 or the 
complement thereof . 



15 



3. 



An isolated nucleic acid molecule encoding the 
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SEQUENCE LISTING 



<110> LEXICON GENETICS INCORPORATED 



<120> Novel Human Enzymes and Polynucleotides 
Encoding the Same 



<130> LEX-0118-PCT 



<150> US 60/179,000 
<151> 2000-01-28 



<160> 15 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 831 
<212> DNA 

<213> Homo sapiens 



<400> 1 

atgacctctt tccgcttggc cctcatccag cttcagattt cttccatcaa atcagataac 60 

gtcactcgcg cttgtagctt catccgggag gcagcaacgc aaggagccaa aatagtttct 120 

ttgccggaat gctttaattc tccatatgga gcgaaatatt ttcctgaata tgcagagaaa 180 

attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag catatatctc 24 0 

attggaggct ctatccctga agaggatgct gggaaattat ataacacctg tgctgtgttt 300 

gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga cattgatgtt 360 

cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag tttctccaca 420 

tttgatactc cttactgcag agtgggtctg ggcatctgct acgacatgcg gtttgcagag 480 

cttgcacaaa tctacgcaca gagaggctgc cagctgttgg tatatccagg agcttttaat 54 0 

ctgaccactg gaccagccca ttgggagtta cttcagcgaa gccgggctgt tgataatcag 600 

gtgtatgtgg ccacagcctc tcctgcccgg gatgacaaag cctcctatgt tgcctgggga 660 

cacagcaccg tggtgaaccc ttggggggag gttctagcca aagctggcac agaagaagca 720 

atcgtgtatt cagacataga cctgaagaag ctggctgaaa tacgccagca aatccccgtt 780 

tttagacaga agcgatcaga cctctatgct gtggagatga aaaagcccta a 831 



<210> 2 
<211> 276 
<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Thr Ser Phe Arg Leu Ala Leu 

1 5 
Lys Ser Asp Asn Val Thr Arg Ala 
20 

Thr Gin Gly Ala Lys lie Val Ser 

35 40 
Tyr Gly Ala Lys Tyr Phe Pro Glu 

50 55 
Ser Thr Gin Lys Leu Ser Glu Val 
65 70 
lie Gly Gly Ser lie Pro Glu Glu 
85 



He Gin Leu Gin He Ser Ser He 

10 15 

Cys Ser Phe He Arg Glu Ala Ala 
25 30 

Leu Pro Glu Cys Phe Asn Ser Pro 
45 

Tyr Ala Glu Lys He Pro Gly Glu 
60 

Ala Lys Glu Cys Ser He Tyr Leu 

75 80 

Asp Ala Gly Lys Leu Tyr Asn Thr 

90 95 
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Cys Ala Val Phe Gly Pro Asp Gly Thr Leu Leu Ala Lys Tyr Arg Lys 

100 105 HO 

lie His Leu Phe Asp lie Asp Val Pro Gly Lys lie Thr Phe Gin Glu 

115 120 125 

Ser Lys Thr Leu Ser Pro Gly Asp Ser Phe Ser Thr Phe Asp Thr Pro 

130 135 140 

Tyr Cys Arg Val Gly Leu Gly lie Cys Tyr Asp Met Arg Phe Ala Glu 
145 150 155 160 

Leu Ala Gin lie Tyr Ala Gin Arg Gly Cys Gin Leu Leu Val Tyr Pro 

165 170 175 

Gly Ala Phe Asn Leu Thr Thr Gly Pro Ala His Trp Glu Leu Leu Gin 

180 185 190 

Arg Ser Arg Ala Val Asp Asn Gin Val Tyr Val Ala Thr Ala Ser Pro 

195 200 205 

Ala Arg Asp Asp Lys Ala Ser Tyr Val Ala Trp Gly His Ser Thr Val 

210 215 220 

Val Asn Pro Trp Gly Glu Val Leu Ala Lys Ala Gly Thr Glu Glu Ala 
225 230 235 240 

lie Val Tyr Ser Asp lie Asp Leu Lys Lys Leu Ala Glu lie Arg Gin 

245 250 255 

Gin lie Pro Val Phe Arg Gin Lys Arg Ser Asp Leu Tyr Ala Val Glu 

260 265 270 

Met Lys Lys Pro 
275 

<210> 3 

<211> 480 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atgtcatgga ggatttcccc tgccacacca tgctgtaggg agttaacttt tcatttgtgc 60 

attttctgtt tggaaacagc ttactgcaga gtgggtctgg gcatctgcta cgacatgcgg 120 

tttgcagagc ttgcacaaat ctacgcacag agaggctgcc agctgttggt atatccagga 180 

gcttttaatc tgaccactgg accagcccat tgggagttac ttcagcgaag ccgggctgtt 240 

gataatcagg tgtatgtggc cacagcctct cctgcccggg atgacaaagc ctcctatgtt 300 

gcctggggac acagcaccgt ggtgaaccct tggggggagg ttctagccaa agctggcaca 360 

gaagaagcaa tcgtgtattc agacatagac ctgaagaagc tggctgaaat acgccagcaa 420 

atccccgttt ttagacagaa gcgatcagac ctctatgctg tggagatgaa aaagccctaa 480 

<210> 4 
<211> 159 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Ser Trp Arg lie Ser Pro Ala Thr Pro Cys Cys Arg Glu Leu Thr 

15 10 15 

Phe His Leu Cys lie Phe Cys Leu Glu Thr Ala Tyr Cys Arg Val Gly 

20 25 30 

Leu Gly lie Cys Tyr Asp Met Arg Phe Ala Glu Leu Ala Gin lie Tyr 

35 40 45 

Ala Gin Arg Gly Cys Gin Leu Leu Val Tyr Pro Gly Ala Phe Asn Leu 

50 55 60 

Thr Thr Gly Pro Ala His Trp Glu Leu Leu Gin Arg Ser Arg Ala Val 
65 70 75 80 
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Asp Asn Gin Val Tyr Val Ala Thr 

85 

Ala Ser Tyr Val Ala Trp Gly His 
100 

Glu Val Leu Ala Lys Ala Gly Thr 
115 120 
lie Asp Leu Lys Lys Leu Ala Glu 

130 135 
Arg Gin Lys Arg Ser Asp Leu Tyr 
145 150 



Ala Ser Pro Ala Arg Asp Asp Lys 

90 95 
Ser Thr Val Val Asn Pro Trp Gly 
105 110 
Glu Glu Ala lie Val Tyr Ser Asp 
125 

He Arg Gin Gin He Pro Val Phe 
140 

Ala Val Glu Met Lys Lys Pro 
155 



<210> 5 

<211> 366 

<212> DNA 

<213> Homo sapiens 



<400> 5 

atgcggtttg cagagcttgc acaaatctac gcacagagag gctgccagct gttggtatat 60 

ccaggagctt ttaatctgac cactggacca gcccattggg agttacttca gcgaagccgg 120 

gctgttgata atcaggtgta tgtggccaca gcctctcctg cccgggatga caaagcctcc 180 

tatgttgcct ggggacacag caccgtggtg aacccttggg gggaggttct agccaaagct 240 

ggcacagaag aagcaatcgt gtattcagac atagacctga agaagctggc tgaaatacgc 300 

cagcaaatcc ccgtttttag acagaagcga tcagacctct atgctgtgga gatgaaaaag 360 

ccctaa 366 



<210> 6 
<211> 121 
<212> PRT 

<213> Homo sapiens 



<400> 6 
Met Arg 
1 

Leu Leu 

Trp Glu 

Ala Thr 

50 
Gly His 
65 

Gly Thr 
Ala Glu 
Leu Tyr 



Phe Ala 

Val Tyr 

20 
Leu Leu 
35 

Ala Ser 

Ser Thr 

Glu Glu 

He Arg 
100 
Ala Val 
115 



Glu Leu Ala 

5 

Pro Gly Ala 

Gin Arg Ser 

Pro Ala Arg 
55 

Val Val Asn 
70 

Ala He Val 
85 

Gin Gin He 
Glu Met Lys 



Gin He 

Phe Asn 

25 
Arg Ala 
40 

Asp Asp 

Pro Trp 

Tyr Ser 

Pro Val 
105 
Lys Pro 
120 



Tyr Ala 
10 

Leu Thr 

Val Asp 

Lys Ala 

Gly Glu 

75 
Asp He 
90 

Phe Arg 



Gin Arg 

Thr Gly 

Asn Gin 

45 
Ser Tyr 
60 

Val Leu 
Asp Leu 
Gin Lys 



Gly Cys Gin 

15 

Pro Ala His 
30 

Val Tyr Val 

Val Ala Trp 

Ala Lys Ala 
80 

Lys Lys Leu 
95 

Arg Ser Asp 
110 



<210> 7 
<211> 507 
<212> DNA 

<213> Homo sapiens 



<400> 7 

atgtcatgga ggatttcccc tgccacacca tgctgtaggg agttaacttt tcatttgtgc 60 
attttctgtt tggaaacagc ttactgcaga gtgggtotgg gcatctgcta cgacatgcgg 120 
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tttgcagagc ttgcacaaat ctacgcacag agaggctgcc agctgttggt atatccagga 180 

gcttttaatc tgaccactgg accagcccat tgggagttac ttcagcgaag ccgggctgtt 240 

gataatcagg tgtatgtggc cacagcctct cctgcccggg atgacaaagc ctcctatgtt 300 

gcctggggac acagcaccgt ggtgaaccct tggggggagg ttctagccaa agctggcaca 360 

gaagaagcaa tcgtgtattc agacatagac ctgaagaagc tggctgaaat acgccagcaa 420 

atccccgttt ttagacagaa gcgaaatatt ttcctgaata tgcagagaaa attcctggtg 480 

aatccacaca gaagctttct gaagtag 507 

<210> 8 
<211> 168 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Ser Trp Arg lie Ser Pro Ala Thr Pro Cys Cys Arg Glu Leu Thr 

15 10 15 

Phe His Leu Cys lie Phe Cys Leu Glu Thr Ala Tyr Cys Arg Val Gly 

20 25 30 

Leu Gly lie Cys Tyr Asp Met Arg Phe Ala Glu Leu Ala Gin lie Tyr 

35 40 45 

Ala Gin Arg Gly Cys Gin Leu Leu Val Tyr Pro Gly Ala Phe Asn Leu 

50 55 60 

Thr Thr Gly Pro Ala His Trp Glu Leu Leu Gin Arg Ser Arg Ala Val 
65 70 75 80 

Asp Asn Gin Val Tyr Val Ala Thr Ala Ser Pro Ala Arg Asp Asp Lys 

85 90 95 

Ala Ser Tyr Val Ala Trp Gly His Ser Thr Val Val Asn Pro Trp Gly 

100 105 110 

Glu Val Leu Ala Lys Ala Gly Thr Glu Glu Ala He Val Tyr Ser Asp 

115 120 125 

He Asp Leu Lys Lys Leu Ala Glu He Arg Gin Gin He Pro Val Phe 

130 135 140 

Arg Gin Lys Arg Asn He Phe Leu Asn Met Gin Arg Lys Phe Leu Val 
145 150 155 160 

Asn Pro His Arg Ser Phe Leu Lys 
165 

<210> 9 

<211> 393 

<212> DNA 

<213> Homo sapiens 

<400> 9 

atgcggtttg cagagcttgc acaaatctac gcacagagag gctgccagct gttggtatat 60 

ccaggagctt ttaatctgac cactggacca gcccattggg agttacttca gcgaagccgg 120 

gctgttgata atcaggtgta tgtggccaca gcctctcctg cccgggatga caaagcctcc 180 

tatgttgcct ggggacacag caccgtggtg aacccttggg gggaggttct agccaaagct 240 

ggcacagaag aagcaatcgt gtattcagac atagacctga agaagctggc tgaaatacgc 300 

cagcaaatcc ccgtttttag acagaagcga aatattttcc tgaatatgca gagaaaattc 360 

ctggtgaatc cacacagaag ctttctgaag tag 393 

<210> 10 

<211> 130 

<212> PRT 

<213> Homo sapiens 
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<400> 10 

Met Arg Phe Ala Glu Leu Ala Gin lie Tyr Ala Gin Arg Gly Cys Gin 

15 10 15 

Leu Leu Val Tyr Pro Gly Ala Phe Asn Leu Thr Thr Gly Pro Ala His 

20 25 30 

Trp Glu Leu Leu Gin Arg Ser Arg Ala Val Asp Asn Gin Val Tyr Val 

35 40 45 

Ala Thr Ala Ser Pro Ala Arg Asp Asp Lys Ala Ser Tyr Val Ala Trp 

50 55 60 

Gly His Ser Thr Val Val Asn Pro Trp Gly Glu Val Leu Ala Lys Ala 
65 70 75 80 

Gly Thr Glu Glu Ala He Val Tyr Ser Asp He Asp Leu Lys Lys Leu 

85 90 95 

Ala Glu He Arg Gin Gin He Pro Val Phe Arg Gin Lys Arg Asn lie 

100 105 110 

Phe Leu Asn Met Gin Arg Lys Phe Leu Val Asn Pro His Arg Ser Phe 
115 120 125 

Leu Lys 
130 

<210> 11 
<211> 459 
<212> DNA 

<213> Homo sapiens 
<400> 11 

atgacctctt tccgcttggc cctcatccag cttcagattt cttccatcaa atcagataac 60 

gtcactcgcg cttgtagctt catccgggag gcagcaacgc aaggagccaa aatagtttct 120 

ttgccggaat gctttaattc tccatatgga gcgaaatatt ttcctgaata tgcagagaaa 18 0 

attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag catatatctc 24 0 

attggaggct ctatccctga agaggatgct gggaaattat ataacacctg tgctgtgttt 300 

gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga cattgatgtt 360 

cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag tttctccaca 4 20 

tttgatactc gtatgtacca gataagtttg cctctttag 459 

<210> 12 
<211> 152 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Thr Ser Phe Arg Leu Ala Leu He Gin Leu Gin He Ser Ser He 

15 10 15 

Lys Ser Asp Asn Val Thr Arg Ala Cys Ser Phe He Arg Glu Ala Ala 

20 25 30 

Thr Gin Gly Ala Lys He Val Ser Leu Pro Glu Cys Phe Asn Ser Pro 

35 40 45 

Tyr Gly Ala Lys Tyr Phe Pro Glu Tyr Ala Glu Lys He Pro Gly Glu 

50 55 60 

Ser Thr Gin Lys Leu Ser Glu Val Ala Lys Glu Cys Ser He Tyr Leu 
65 70 75 80 

He Gly Gly Ser lie Pro Glu Glu Asp Ala Gly Lys Leu Tyr Asn Thr 

85 90 95 

Cys Ala Val Phe Gly Pro Asp Gly Thr Leu Leu Ala Lys Tyr Arg Lys 

100 105 110 

lie His Leu Phe Asp lie Asp Val Pro Gly Lys He Thr Phe Gin Glu 
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115 120 125 

Ser Lys Thr Leu Ser Pro Gly Asp Ser Phe Ser Thr Phe Asp Thr Arg 

130 135 140 

Met Tyr Gin lie Ser Leu Pro Leu 
145 150 

<210> 13 
<211> 858 
<212> DNA 

<213> Homo sapiens 
<40.0> 13 

atgacctctt tccgcttggc cctcatccag cttcagattt cttccatcaa atcagataac 60 

gtcactcgcg cttgtagctt catccgggag gcagcaacgc aaggagccaa aatagtttct 120 

ttgccggaat gctttaattc tccatatgga gcgaaatatt ttcctgaata tgcagagaaa 180 

attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag catatatctc 24 0 

attggaggct ctatccctga agaggatgct gggaaattat ataacacctg tgctgtgttt 300 

gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga cattgatgtt 360 

cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag tttctccaca 420 

tttgatactc cttactgcag agtgggtctg ggcatctgct acgacatgcg gtttgcagag 480 

cttgcacaaa tctacgcaca gagaggctgc cagctgttgg tatatccagg agcttttaat 54 0 

ctgaccactg gaccagccca ttgggagtta cttcagcgaa gccgggctgt tgataatcag 600 

gtgtatgtgg ccacagcctc tcctgcccgg gatgacaaag cctcctatgt tgcctgggga 660 

cacagcaccg tggtgaaccc ttggggggag gttctagcca aagctggcac agaagaagca 720 

atcgtgtatt cagacataga cctgaagaag ctggctgaaa tacgccagca aatccccgtt 780 

tttagacaga agcgaaatat tttcctgaat atgcagagaa aattcctggt gaatccacac 840 

agaagctttc tgaagtag 858 

<210> 14 
<211> 285 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Met Thr Ser Phe Arg Leu Ala Leu He Gin Leu Gin He Ser Ser He 

15 10 15 

Lys Ser Asp Asn Val Thr Arg Ala Cys Ser Phe He Arg Glu Ala Ala 

20 25 30 

Thr Gin Gly Ala Lys He Val Ser Leu Pro Glu Cys Phe Asn Ser Pro 

35 40 45 

Tyr Gly Ala Lys Tyr Phe Pro Glu Tyr Ala Glu Lys He Pro Gly Glu 

50 55 60 

Ser Thr Gin Lys Leu Ser Glu Val Ala Lys Glu Cys Ser He Tyr Leu 
65 70 75 80 

lie Gly Gly Ser He Pro Glu Glu Asp Ala Gly Lys Leu Tyr Asn Thr 

85 90 95 

Cys Ala Val Phe Gly Pro Asp Gly Thr Leu Leu Ala Lys Tyr Arg Lys 

100 105 110 

lie His Leu Phe Asp lie Asp Val Pro Gly Lys lie Thr Phe Gin Glu 

115 120 125 

Ser Lys Thr Leu Ser Pro Gly Asp Ser Phe Ser Thr Phe Asp Thr Pro 

130 135 140 

Tyr Cys Arg Val Gly Leu Gly He Cys Tyr Asp Met Arg Phe Ala Glu 
145 150 155 160 

Leu Ala Gin He Tyr Ala Gin Arg Gly Cys Gin Leu Leu Val Tyr Pro 
165 170 175 
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Gly Ala Phe Asn Leu Thr Thr Gly 
180 

Arg Ser Arg Ala Val Asp Asn Gin 
195 200 
Ala Arg Asp Asp Lys Ala Ser Tyr 

210 215 
Val Asn Pro Trp Gly Glu Val Leu 
225 230 
lie Val Tyr Ser Asp lie Asp Leu 
245 

Gin lie Pro Val Phe Arg Gin Lys 
260 

Arg Lys Phe Leu Val Asn Pro His 
275 280 

<210> 15 
<211> 3093 
<212> DNA 
<213> Homo sapiens 

<400> 15 

ggatggtggg gcatacctgt ggtcccagct acataagagg ctgagacaag aggattgcct 60 

gaactgagta ggtcaaggct gcagtggacc atgtttgtgc cactgcactc cagcctgggc 120 

gacagaacaa ggccctgcct caaaataaaa aatattagct aatggaaagt gattatcata 180 

aaagctaaaa gggaacttta aagaacagaa gaaaagcaaa tatgatgtat agctactacc 240 

tccaggaaga aataagcttg gaagagcccc caacctcctt gctccagggc tgagcacaga 300 

ccttgtcagg gctggctaca taatttgtgg ggcccagttc ccttgttcag atagcaagag 360 

aaaagtgctg ttagcttttc cttctgcagt atctctttca acctctcatg gtgttatttg 420 

ctgtttaatg tcatgttctc ttggacacat gaatacttat ggggtaagtg cagactttta 4 80 

gaggtgcctg ggacccctgt cctgtgaata ggcatgtgtg cagctcactg gctgccaggt 540 

tttccctctg ccagcagcgg gatcgatgtg ctgtgaccca gccagtagtg gggaaactga 600 

gacagacatc ttcccttccc atgagctggg cctgctcatg ggaattatgt gagcagcttc 660 

caaggaatca cactttctgt gctgggacat actcaagtat atggattgga ggtagacgag 720 

aggcccattg aacaaacagt aagggacagg accatattca aacccagtct ttttacttta 7 80 

agccatattc ctcatttcat tcccctacac tgcgtagtaa gaagctggtt cactctagat 840 

tcttgtgcct ggcatgggac tttgcccatg gatattgctc tatctccaga tagattttag 900 

actattgaca ttttggacag gataattctt cgttgtgtta tggagggggt tgtcctatgc 9 60 

attgtaggat gtttggcagt atccttggtc tctattcatt agatgccact catacctcat 1020 

cagttgtggc atcaaaggta tcttcagaca ttgtcagatg tccccccggg gacataactg 1080 

ccttccattt gagaactatg gctctgtctg aatccagcag ttcgatcttc tgatagctgt 114 0 

tttcttttgt ctttgttctc agcccccccc cccccggtag gacccgcggt ccgccggatc 1200 

tccagcgctc agtccgcgcc gcaggtggtg cttgtctgca gagtcatgac ctctttccgc 12 60 

ttggccctca tccagcttca gatttcttcc atcaaatcag ataacgtcac tcgcgcttgt 1320 

agcttcatcc gggaggcagc aacgcaagga gccaaaatag tttctttgcc ggaatgcttt 1380 

aattctccat atggagcgaa atattttcct gaatatgcag agaaaattcc tggtgaatcc 1440 

acacagaagc tttctgaagt agcaaaggaa tgcagcatat atctcattgg aggctctatc 1500 

cctgaagagg atgctgggaa attatataac acctgtgctg tgtttgggcc tgatggaact 1560 

ttactagcaa agtatagaaa gatccatctg tttgacattg atgttcctgg aaaaattaca 1620 

tttcaagaat ctaaaacatt gagtccgggt gatagtttct ccacatttga tactcgtatg 1680 

taccagataa gtttgcctct ttagcaatct cagtagaaga caatcaggta tttatttctt 17 4 0 

ttttgtctct ctccgatttc ttcacataac ctaactgaaa gaccataagt gagaaaggca 1800 

gagaatcatc acagatctgg aaagttcggg cttatttgag aactaaggat ttgacacgat 18 60 

tttgcccttt gatttgattg tagcttcctg ttacggcttc cagagtatac ctattaggct 1920 

acagttgagt acctcccatc tagataataa gcattcaatt agaatgaatt tctcatcttt 1980 

actccgctga tgtaaatgat gtctttatga gatgaagtcc aagtaggaat gagcttgtaa 204 0 

attatctctg tcctcaggtc ctgtgttaat ttatccctgt cagtgttttg tgatcattat 2100 



Pro Ala His Trp Glu Leu Leu Gin 
185 190 
Val Tyr Val Ala Thr Ala Ser Pro 
205 

Val Ala Trp Gly His Ser Thr Val 

220 

Ala Lys Ala Gly Thr Glu Glu Ala 
235 240 
Lys Lys Leu Ala Glu lie Arg Gin 

250 255 
Arg Asn lie Phe Leu Asn Met Gin 
265 270 
Arg Ser Phe Leu Lys 
285 
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gtcatggagg atttcccctg ccacaccatg ctgtagggag ttaacttttc atttgtgcat 2160 

tttctgtttg gaaacagctt actgcagagt gggtctgggc atctgctacg acatgcggtt 2220 

tgcagagctt gcacaaatct acgcacagag aggctgccag ctgttggtat atccaggagc 2280 

ttttaatctg accactggac cagcccattg ggagttactt cagcgaagcc gggctgttga 2340 

taatcaggtg tatgtggcca cagcctctcc tgcccgggat gacaaagcct cctatgttgc 2400 

ctggggacac agcaccgtgg tgaacccttg gggggaggtt ctagccaaag ctggcacaga 24 60 

agaagcaatc gtgtattcag acatagacct gaagaagctg gctgaaatac gccagcaaat 2520 

ccccgttttt agacagaagc gatcagacct ctatgctgtg gagatgaaaa agccctaaag 2580 

tttatgtttc taatgtgtca cagaatagga cgatatgatt ctacaacata atcaactccc 2640 

tattaaattc tttaatgaag aaaaaaaaaa aaaaaaaaaa aaaaaatatt ttcctgaata 2700 

tgcagagaaa attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag 27 60 

catatatctc attggaggct ctatccctga agaggatgct gggaaattat ataacacctg 2820 

tgctgtgttt gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga 2880 

cattgatgtt cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag 2940 

tttctccaca tttgatactc cttactgcag agtgggtctg ggcatctgct acgacatgcg 3000 

gtttgcagag cttgcacaaa tctacgcaca gagaggctgc cagctgttgg tatatccagg 3060 

agcttttaat ctgaccactg gaccagccca ttg 3093 
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